Your selections:
Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples With On-Policy Experiences
- Banerjee, Chayan, Chen, Zhiyong, Noman, Nasimul
Improving sample efficiency in deep reinforcement learning based control of dynamic systems
Optimal Actor-Critic Policy With Optimized Training Datasets
- Banerjee, Chayan, Chen, Zhiyong, Noman, Nasimul, Zamani, Mohsen
Are you sure you would like to clear your session, including search history and login status?